Institute of Information and Communication Technologies
نویسندگان
چکیده
We explore lexical choice in Natural Language Generation (NLG) by implementing a model that uses both context and frequency information. Our model chooses a lemma given a WordNet synset in the abstract representations that are the input for generation. In order to find the correct lemma in its context, we map underspecified dependency trees to Hidden Markov Trees that take into account the probability of a lemma given its governing lemma, as well as the probability of a word sense given a lemma. A tree-modified Viterbi algorithm is then utilized to find the most probable hidden tree containing the most appropriate lemmas in the given context. Further processing ensures that the correct morphological realization for the given lemma is produced. We evaluate our model by comparing it to a statistical transfer component in a Machine Translation system for English to Dutch. In this set-up, the word sense of words are determined in English analysis, and then our model is used to select the best Dutch lemma for the given word sense. In terms of BLEU score, our model outperforms a most frequent baseline, in which the most frequent lemma of a given word sense is always chosen. A manual evaluation confirms that our model is able to select the correct lemma when it is given a correct input synset. The majority of errors were caused by incorrect assignment of the word sense in the English analysis phase. Our model does not improve upon a transfer component trained on a parallel corpus. In the original transfer component, there barely are any lemmas that were incorrectly translated in the transfer phase, with the exception of Out of Vocabulary items (OOV’s). In a further experiment we only used our model for OOV’s and obtained a small improvement in BLEU score.
منابع مشابه
Application of Big Data Analytics in Power Distribution Network
Smart grid enhances optimization in generation, distribution and consumption of the electricity by integrating information and communication technologies into the grid. Today, utilities are moving towards smart grid applications, most common one being deployment of smart meters in advanced metering infrastructure, and the first technical challenge they face is the huge volume of data generated ...
متن کاملImproving QoS in VANETs: A Survey
The systems in which information and communication technologies and systems engineering concepts are utilized to develop and improve transportation systems of all kinds are called “The Intelligent Transportation Systems (ITS)”. ITS integrates information, communications, computers and other technologies and uses them in the field of transportation to build an integrated system of people, roads ...
متن کاملInvestigating the Impact of Information Literacy and Information Communication Technologies on Knowledge Sharing among Public Library Librarians
Purpose. Information literacy and information communication technologies are the determining factors in today's ever-changing conditions. In this study, the relationship between information literacy and knowledge sharing with the mediating role of information communication technologies among public library staff has been studied to determine the role of these two factors in knowledge sharing by...
متن کاملRole of ICT in Sustainable Urban Development by Using Model SWOT
Information and communication technologies are the most recent scientific achievements of mankind's ability seems to have much to offer society and are expected to be useful in solving the problems of human society. Many around the world believe that accelerates the process of adjustment in the exchange of knowledge and information through information and communication technologies a vital role...
متن کاملRole of ICT in Sustainable Urban Development by Using Model SWOT
Information and communication technologies are the most recent scientific achievements of mankind's ability seems to have much to offer society and are expected to be useful in solving the problems of human society. Many around the world believe that accelerates the process of adjustment in the exchange of knowledge and information through information and communication technologies a vital role...
متن کاملConstraints to Effective Use of Information Communication Technologies (ICTs) among Small-scale Farmers in Anambra State, Nigeria
The study was carried out in Anambra State, Nigeria. Questionnaire was used to collect data from a sample of one hundred and eight (108) small-scale farmers. Percentage, mean score, standard deviation and factor analysis were used for data analysis. The duration of the study was June 2009 and March, 2010. Results of the study indicated that the major constraints to effective use of ICTs by smal...
متن کامل